Model Selection

Mathematical Reasoning Enhancement

# Mathematical Reasoning Enhancement

Unireason Qwen3 14B RL I1 GGUF

UniReason-Qwen3-14B-RL is a quantized model applicable to multiple domains, especially proficient in text generation and mathematical reasoning tasks.

Large Language Model

Transformers English

A fine-tuned version based on Qwen3-1.7B, which enhances mathematical reasoning ability through one-shot reinforcement learning and verifiable reward (RLVR) methods, and performs excellently in mathematical benchmark tests and coding tasks.

Large Language Model

Acereason Nemotron 7B

A math and code reasoning model trained through reinforcement learning, based on DeepSeek-R1-Distilled-Qwen-7B, excelling in mathematical and code reasoning tasks

Large Language Model

Qwq Bakeneko 32b

A Japanese dialogue model optimized through merging Qwen2.5-32B and QwQ-32B, enhanced with Chat Vector and ORPO technologies for improved instruction following

Large Language Model

Transformers Japanese

Thinkedit Deepseek Llama3 8b

ThinkEdit is a lightweight weight editing method that identifies and modifies a small number of attention heads to alleviate the issue of overly brief reasoning chains generated by inference models, thereby improving reasoning accuracy.

Large Language Model

Open Reasoner Zero 32B

The first open-source implementation of large-scale reasoning-oriented reinforcement learning focusing on scalability, simplicity, and ease of use

Large Language Model

Open-Reasoner-Zero

Granite 8b Code Instruct 4k

Granite-8B-Code-Instruct-4K is an 8-billion-parameter code instruction model, fine-tuned on various permissible instruction datasets based on Granite-8B-Code-Base-4K, enhancing its ability to follow instructions, including logical reasoning and problem-solving skills.

Large Language Model

Transformers Other

Granite 3b Code Instruct 2k

Granite-3B-Code-Instruct-2K is a 3-billion-parameter model fine-tuned from Granite-3B-Code-Base-2K, with enhanced instruction-following capabilities, particularly excelling in code generation and logical reasoning tasks.

Large Language Model

Transformers Other

Mathgenie InterLM 20B

MathGenie is a model that enhances the mathematical reasoning capabilities of large language models by generating synthetic data through question back-translation.

Large Language Model

Transformers Supports Multiple Languages

Codellama 7b Hf ReFT GSM8k

Enhances the reasoning generalization capabilities of large language models through reinforcement fine-tuning, based on Codellama fine-tuning, suitable for code generation and comprehension tasks.

Large Language Model

Math Shepherd Mistral 7b Rl

A math problem-solving model based on Math-Shepherd's step-by-step reinforcement learning, excelling on GSM8K and MATH datasets

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase